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The severe acute respiratory syndrome (SARS) coro- 
navirus (CoV) main protease represents an attractive 
target for the development of novel anti-SARS agents. 
The tertiary structure of the protease consists of two 
distinct folds. One is the N-terminal chymotrypsin-like 
fold that consists of two structural domains and consti- 
tutes the catalytic machinery; the other is the C-termi- 
nal helical domain, which has an unclear function and is 
not found in other RNA virus main proteases. To under- 
stand the functional roles of the two structural parts of 
the SARS-CoV main protease, we generated the full- 
length of this enzyme as well as several terminally trun- 
cated forms, different from each other only by the num- 
ber of amino acid residues at the C- or N-terminal 
regions. The quaternary structure and K, value of the 
protease were analyzed by analytical ultracentrifuga- 
tion. The results showed that the N-terminal 1-3 amino 
acid-truncated protease maintains 76% of enzyme activ- 
ity and that the major form is a dimer, as in the wild 
type. However, the amino acids 1-4-truncated protease 
showed the major form to be a monomer and had little 
enzyme activity. As a result, the fourth amino acid 
seemed to have a powerful effect on the quaternary 
structure and activity of this protease. The last C-termi- 
nal helically truncated protease also exhibited a greater 
tendency to form monomer and showed little activity. 
We concluded that both the C- and the N-terminal re- 
gions influence the dimerization and enzyme activity of 
the SARS-CoV main protease. 


Severe acute respiratory syndrome (SARS)? is an acute res- 
piratory illness that causes an atypical and highly contagious 
pneumonia. It has spread in many countries and areas. The 
international outbreak of this disease resulted in great damage 
to public health (1-3). A novel form of coronavirus (CoV), 
SARS-CoV, has been identified to be the major cause of 
SARS (3-8). 

Coronaviruses are positive-sense, single-stranded RNA vi- 
ruses. The genome of SARS-CoV is ~30,000 nucleotides in 
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length, and the organization is similar to that of other corona- 
viruses. The replicase gene encodes two overlapping polypro- 
teins, polyprotein la (~450 kDa) and polyprotein lab (~750 
kDa). The polyproteins are cleaved by the internally encoded 
main protease (MP*°, 3CL), which is required for the production 
of new infectious viruses. The main protease represents an 
attractive target for the development of novel anti-viral agents 
due to the functional importance of this enzyme in the viral life 
cycle (9-12). 

The crystal structures of reported CoV, including the SARS- 
CoV main proteases, are homodimers (138-15). Each protomer 
of the main protease is composed of three structural domains 
(Fig. 1). The first two domains of the SARS-CoV main protease 
have an antiparallel B-barrel structure, which is similar to the 
other CoV proteases, and form a chymotrypsin-like fold respon- 
sible for catalytic reactions (15). The active site containing a 
catalytic dyad defined by His-41 and Cys-145 is located be- 
tween domains I and II. The third domain contains five a-hel- 
ices with an unclear biological function. The domain III of one 
protomer and the domain II of another form a contacting region 
in the dimer. The N terminus (N-finger containing amino acid 
residues 1-7) is seated at this region and plays an important 
role in dimerization (16). 

We have demonstrated that the major quaternary structure 
of SARS-CoV main protease at neutral pH is a dimer, which is 
the catalytically competent form (17). It is ultimately impor- 
tant to understand the factors that control dimerization, as the 
dissociated monomer could be enzymatically inactive. The C- 
terminal helical domain interacts with the active site of an- 
other protomer in the dimer and switches the enzyme molecule 
from the inactive form to the active form (18). The structural 
and biochemical data also show that the N-terminal residues 
1-7 play an important role in the dimerization and formation of 
the active site of SARS main protease (15). N-terminal trunca- 
tion of the whole N-finger, A(1—7) results in almost complete 
loss of enzymatic activity (19). 

In this report, we studied critically the functional role of the 
N and C termini by serial truncations. We report the stability 
and structure-function relationship of the full-length SARS- 
CoV main protease in comparison with the various truncated 
forms. Our results demonstrate that both N- and C-terminal 
regions are involved in the enzyme activity as well as in dimer- 
ization. We have narrowed down the critical amino acid resi- 
dues to the fourth amino acid residue of the N-terminal and the 
last helical amino acids of the C-terminal region as those in- 
volved in dimerization to give a correct conformation of the 
active site. 


EXPERIMENTAL PROCEDURES 


Construction of WT and N- and C-terminally Truncated SARS-CoV 
Main Proteases—The plasmids containing the full-length main protease 
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Domain | 


A(1-3) 
A(1-4) 
A(1-5) 
A(1-6) 
A(\-7) 
A(293-306) 
(201-306) 


Protomer A 


Domain II 


SARS-CoV Main Protease 


Domain III 


Fic. 1. Structural features of the dimeric SARS-CoV main protease. A, secondary structure of the enzyme and the deleted regions of the 
various deletion mutants. B, the enzyme structure (Protein Data Bank code 1uk3) is presented as ribbons and labeled as to coincide with the stick 
model. Protomer A is colored with the catalytic domains I and II in blue, and the helical domain III in purple. The N-finger residues and the last 
a-helix of protomer A are shown in stick model form in blue and CPK (oxygen in red, nitrogen in blue, sulfur in yellow, and gray for all other atoms), 
respectively. The catalytic dyad His-41 and Cys-145 are in red. The surface of protomer B is shown in mesh model form. This figure was generated 
with Spock software. C, the backbone of the enzyme is shown as a stick model in blue and green, and the major pockets are shown as a space-filling 


model. This figure was generated with CASTp (39). 


were kindly provided by Dr. Shao-Hung Wang (Genome Research Cen- 
ter, National Yang-Ming University, Taipei, Taiwan). The genes of the 
full-length SARS-CoV main protease were amplified by polymerase 
chain reaction with appropriate primers. The forward primer for the 
full-length WT SARS-CoV main protease was 5'-GGTGGTCATAT- 
GAGTGGTTTTAGG-3’, and the reverse primer was 5'-AACTCGAGGG- 
TAACACCAGAG-3’. After digestion with BglII and Xhol, the PCR 
product was cut into two fragments, 168 and 747 bp. The 168-bp 
fragment was then digested with Ndel. Finally, the 168-bp NdeI-BglIT 
and 747-bp BglII-Xhol fragments were co-ligated to the NdeI and Xhol 
sites of the vector pET-29a(+) (Novagen, Madison, WI). 

The N-terminally truncated proteases were made by PCR. The 
forward primers were as follows: A(1-3), 5'-GGAGATATACATATGAG- 
GAAAATGGCATTC-3’; A(1—4), 5'’-GGAGATATACATATGAAAATGG- 
CATTCCCG-3’; A(1-5), 5’-GGAGATATACATATGATGGCATTCCCGT- 
CA-3'; A(1-6), 5'-GGAGATATACATATGGCATTCCCGTCAGGC-3’; 
and A(1-7), 5’-GGAGATATACATATGTTCCCGTCAGGCAAA-3'. The 
reverse mutagenic primers were as follows: A(1-3), 5’-GAATGCCATT- 
TTCCTCATATGTATATCTCC-3’; A(1—4), 5'’-CGGGAATGCCATTTTC- 
ATATGTATATCTCC-3’; A(1-5), 5'-TGACGGGAATGCCATCATATGT- 
ATATCTCC-3’; A(1-6), 5'-GCCTGACGGGAATGCCATATGTATATCT- 
CC-3'; and A(1-7), 5'-TTTGCCTGACGGGAACATATGTATATCTCC-3’. 

The pET-SARS-CoV main protease vector was used as the template. 
The DNA polymerase Pfu (Promega, Madison, WI) extended and incor- 
porated the mutagenic primers in the process of PCR. After 16-18 
temperature cycles, the N-terminally truncated plasmid containing 
staggered nicks was generated. The PCR products were then treated 
with DpnI (New England Biolabs, Beverly, MA) to digest the template. 
Finally, the vector containing the protease cDNA with the desired 


mutation was transformed into Escherichia coli. The C-terminally trun- 
cated proteases were subsequently amplified by PCR using the follow- 
ing sequences. The forward primer, 5'-TGAAGATCTGCTCATTCGCA- 
A-3’; reverse primer of A(293-306), 5‘’-AACTCGACTGTAAACTCATC- 
TTC-3’; reverse primer of A(201-306), 5’-AACTCGACTATGGTTGTG- 
TCTG-3’. After digestion with BglII and XholI, the PCR products were 
inserted into the BglII and XhoI sites of the pET-SARS-CoV main 
protease. The DNA sequences of the full-length, N- and C-terminally 
truncated SARS-CoV main proteases were checked by autosequencing. 
The recombinant SARS-CoV main protease has a His tag at the C 
terminus. This His tag was not removed, because earlier reports 
seemed to rule out the possible effect of the His tag on the dimeric 
structure or enzyme activity (17). 

Expression and Purification of WT and N- and C-terminally Trun- 
cated SARS-CoV Main Proteases—The modified plasmids of the recom- 
binant proteases were transformed into the E. coli strain BL21 (DE3)- 
competent cells. The cells were grown at 37°C in Luria-Bertani 
medium with 50 ywg/ml kanamycin until the absorbance at 600 nm 
reached 0.8 and were then induced by 1 mM isopropyl-1-thio-B-D-galac- 
toside at 18 °C overnight. The cells were centrifuged at 5,000 x g, 4 °C 
for 10 min. The supernatant was removed, and the pelleted cells were 
then suspended in binding buffer (20 mm Tris-HCl, 300 mm NaCl, and 
2 mM BME, pH 7.6). The cells were sonicated for 10 min at 10-s burst 
cycles at 300 W with a 10-s cooling period between each burst. The cell 
debris was removed by centrifugation (10,000 x g at 4 °C for 25 min). 
One ml of binding buffer-equilibrated nickel-nitrilotriacetic acid slurry 
(Qiagen, Hilden, Germany) was then added to the soluble lysate, and 
the solution was mixed gently at 4 °C for 50 min to equilibrium. The 
lysate-nickel-nitrilotriacetic acid mixture was then loaded into a col- 
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Fic. 2. CD spectra of the full-length 
WT and truncated SARS-CoV main 
proteases. Far-UV CD spectra of all re- 
combinant SARS-CoV main proteases 
were monitored at a 0.8 mg/ml concentra- 
tion in 10 mm PBS buffer (pH 7.6) at 
25 °C, where deg is the ellipticity in 
degrees. 


SARS-CoV Main Protease 
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TABLE I 
Structural characteristics of the recombinant SARS-CoV main proteases in crystal and solution 
Protease a-Helix B-Strand Turns Unordered pe oe Dee (ry? 
°C. nm 
1uk3* 22 25 20 33 
WT 21 27 22 30 0.030 54.8 342.1 
A(1-3) 20 25 21 34 0.023 54.8 341.8 
A(1-4) 22 23 19 35 0.050 52.0 341.8 
A(1-5) 22 26 19 33 0.028 53.4 341.8 
A(1-6) 21 26 18 34 0.025 51.5 ND* 
A(1-7) 21 28 20 31 0.024 54.6 342.1 
A(293-306) 24 27 19 30 0.024 48.4 342.1 
A(201-306) 4 31 21 43 0.140 55.0 331.1 


“ Crystal structure determined by x-ray crystallography (Protein Data Bank code 1uk3). 


’ Normalized root mean S.D. of the secondary structure fitting results. 
° Melting temperature determined by the CD spectropolarimeter. 


¢ Average fluorescence emission wavelength shift. 


© Not determined. 
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umn and washed with the washing buffer (20 mm imidazole, 20 mm 
Tris-HCl, 300 mm NaCl, and 2 mm BME, pH 7.6). Finally, the protease 
was eluted with elution buffer (400 mM imidazole, 20 mm Tris-HCl, 300 
mM NaCl, and 2 mm BME, pH 7.6). The purified protein was then 
concentrated at 4 °C using Amicon Ultra-4 centrifugal filter units (Mil- 


WT 

A(1-3) 
A(1-4) 
A(1-5) 
A(1-7) 
A(293-306) 
A(201-306) 


Fic. 3. Fluorescence spectra of the 
full-length WT and truncated SARS- 
CoV main proteases. Fluorescence 
emission spectra of all recombinant pro- 
teases were monitored at 8 ug/ml concen- 
tration in 10 mm PBS buffer (pH 7.6) at 
25 °C. 


lipore, Bedford, MA) with molecular mass cutoff at 10 kDa. The purified 
protein was concentrated to 5-15 mg/ml and then diluted to 0.5-3 
mg/ml by 10 mM PBS (containing 10 mM sodium phosphate buffer, 150 
mM NaCl, 2 mm BME, pH 7.6), which was used to replace the elution 
buffer over six concentration-dilution cycles. The sample from the pu- 
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rification step was separated on a 4-12% gradient sodium dodecyl 
sulfate polyacrylamide gel to check the homogeneity. 

Circular Dichroism (CD) and Fluorescence Analyses—CD experi- 
ments were performed in a Jasco J-810 spectropolarimeter (Tokyo, 
Japan) equipped with a Neslab RTE-111 water-circulated thermal con- 
troller. The samples were prepared in 10 mm PBS solution at pH 7.6 
with a protein concentration of 0.5 mg/ml. Far-UV CD spectra from 250 
to 190 nm were collected using a 0.01-cm path-length cuvette with a 
0.1-nm spectral resolution at 25 °C. Ten independent scans were aver- 
aged for each sample. All spectra were corrected for buffer contributions 
and converted to mean residue ellipticity ([O]). The [O] at each wave- 
length was calculated from Equation 1, 


MRW:0, 


[Ol Toke 


(Eq. 1) 
where MRW is the mean residue weight (a value of 111.3 was used for 
the WT), 6, is the measured ellipticity in degree at wavelength A, / is the 
cuvette path length (0.01 cm), and c is the protein concentration in g/ml. 
The secondary structure analysis was performed by DICHROWEB (20, 
21), which provides an interactive web site server allowing the decon- 
volution of data from circular dichroism spectroscopy experiments (pub- 
lic-1.cryst.bbk.ac.uk/cdweb/html/). DICHROWEB offers several impor- 
tant types of software, such as CDSSTR (22-24), CONTINLL (25, 26), 
SELCONS (27, 28), and K2D (29). 

Thermal stability of the WT and various truncated mutants were 
analyzed with a spectropolarimeter by monitoring the 222-nm circular 
dichroism at the temperature range between 30 and 90 °C, and the 
temperature at which half of the protein molecules were unfolded was 
recorded (T.,,). 

Fluorescence experiments were performed in a PerkinElmer Life 
Sciences 50B luminescence spectrometer (Beaconsfield, Backingham- 
shire, England). The sample was prepared in 10 mm PBS solution at pH 
7.6 with a protein concentration of 8 ug/ml. The fluorescence emission 
spectra from 300 to 400 nm were collected after excitation at 280 nm. 
Fluorescence spectra of proteins were determined with a 1-cm path 
quartz cuvette at 25 °C. The spectral bandwidth was 5 nm for excitation 
and 10 nm for emission. All spectra were corrected for the buffer 
contribution. The average emission wavelength ((A)) was calculated 
from Equation 2 (30), 


AN 


SFA) 


i= 


(4) = 


7 


i=A1 


(Eq. 2) 


where F is the fluorescence intensity and A is the wavelength. 

Analytical Ultracentrifugation Analysis—The molar mass and sedi- 
mentation coefficient of the proteases were analyzed by a sedimentation 
velocity experiment. It was performed on a Beckman Optima XL-A 
analytical Ultracentrifuge (Fullerton, CA). Prior to the experiments, 
the sample was diluted to various protein concentrations with 10 mm 
PBS buffer at pH 7.6. Sample (400 pl) and buffer (440 pl) solutions were 
loaded into the double sector centerpiece separately. All experiments 
were carried out at 20 °C with an An50 rotor at the speed of 42,000 
revolutions/min. The proteins were measured by UV absorbance at 280 
nm in a continuous mode with a time interval of 480 s. The recorded 
scans at different time points were collected and fitted to a continuous 
size distribution model by the SEDFIT program (31-34). The observed 
sedimentation profiles of a continuous size distribution (c(s)) can be 
calculated from Equation 3, 


a(r,t) = [evottsDonas +e (Eq. 3) 


where a(r,t) denotes the experimentally observed signal, L(s,D,r,t) de- 
notes the solution of the Lamm equation for a single species (35), and e 
is the noise component. 

For a precise determination of the monomer-dimer equilibrium of the 
SARS-CoV main protease, the sedimentation velocity experiment was 
performed at three different protein concentrations, and all sedimenta- 
tion data were subjected to the monomer-dimer equilibrium model 
fitting. The partial specific volume of the protease, solvent density, and 
viscosity were calculated by the software program SEDNTERP (36). 
The dissociation constant (K,) was calculated by the global modeling of 
the SEDPHAT program (33). 
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Fic. 4. Continuous sedimentation coefficient distribution of 
the full-length WT and N-terminally truncated SARS-CoV main 
protease. The residual bitmaps of various main proteases are shown in 
the insets. All enzyme preparations used a concentration of 1 mg/ml in 
10 mm PBS buffer (pH 7.6) at 20 °C. A, WT; B, A(1-3); C, A(1—4); D, 
A(1-5); #, A—6); F, A(—7). The left dotted line indicates the monomer, 
and the right one is the dimer form. 
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Enzymatic Activity Assay of the SARS-CoV Main Protease Using a 
Fluorogenic Substrate—The kinetic measurements of the SARS-CoV 
main protease activity were performed in 10 mm PBS with 2 mm BME 
at 30 °C. The reaction was initiated by adding 12 wg of WT and 1.5—2.0 
mg of mutants in 1 ml of reaction mixture. Enhanced fluorescence due 
to cleavage of the internally quenched fluorogenic substrate peptides 
(ortho-aminobenzoic acid-TSAVLQSGFRK-2,4-dinitrophenylamide) by 
protease was monitored at 420 nm with excitation at 362 nm using a 
PerkinElmer Life Sciences 50B luminescence spectrometer. 

The mixture containing N-terminal peptides (ortho-aminobenzoic ac- 
id-TSAVLQ) and C-terminal peptides (SQFRK-2,4-dinitrophenylamide) 
of different concentrations were prepared to monitor the specific fluoro- 
genic intensity. All intensities were corrected for the buffer contribu- 
tions. The serial intensities at 420 nm were used to construct a stand- 
ard curve for quantifying the product. In this way, the enzyme activity 
could be precisely determined. 


RESULTS 


Expression, Purification, and Characterization of the Recom- 
binant SARS-CoV Main Proteases—The full-length and trun- 
cated SARS-CoV main proteases have been successfully ex- 
pressed in E. coli and purified by a single affinity column. All 
the recombinant SARS-CoV main proteases were found in the 
soluble fraction of the cell lysate. The expressed SARS-CoV 
main proteases bound to the nickel column, but other pro- 
teins flowed through the column as refuse. SDS-PAGE anal- 
ysis indicated that the recombinant proteins were almost 
homogeneous in solution. All purified proteins had M, in 
agreement with the theoretical values. After concentration, 
5-15 mg/ml recombinant main protease could be obtained 
from 200 ml of cells. Unfortunately, the other C-terminally 
truncated mutants were not successfully expressed, probably 
because of their instabilities. 

To determine the secondary structure of the successfully 
expressed and purified recombined main proteases, far-UV CD 
spectra were recorded. The overall CD spectra were shown in 
Fig. 2. The spectra of all recombinant SARS-CoV main pro- 
teases seemed to be similar, except that of A(201—306) as an- 
ticipated (Fig. 1A). These results indicated that the proteins 
have a well defined secondary structure. This is reflected in the 
secondary structural estimation by the DICHROWEB server 
(20, 21). The CDSSTR analysis is shown in Table I. The nor- 
malized root mean square deviation values of the data fitting 
for WT and various truncations were all <0.2 and thus showed 
excellent goodness-of-fit parameters (37). The analysis of the 
secondary structure of the full-length SARS-CoV main protease 
is consistent with the data derived from the crystal structure, 
1uk3 (Table I). The helical contents of the recombinant WT and 
crystal structure (luk3) were 0.21 and 0.22, respectively. All 
truncations showed similar results with full-length main pro- 
tease, except that A(201-306) had significant low a-helix con- 
tent, which was in agreement with the structure in which the 
whole helical domain III was deleted. The thermal stability of 
the recombinant SARS-CoV main proteases was also examined 
(Table I). Among the truncated SARS-CoV main proteases, the 
C-terminally truncated protease A(293—306) has significantly 
lower T,,, than WT. 

The fluorescence emission spectra of the recombinant SARS- 
CoV main proteases were shown in Fig. 3. Only A(201—306) had 
significantly low fluorescence intensity. The average emission 
wavelengths were calculated by the method of Sanchez del Pino 
and Fersht (30), which accounts for both wavelength shift and 
the fluorescence intensity attenuation. The average emission 
wavelength of the full-length SARS-CoV main protease is 342 
nm. With the only exception of A(201-306), which has an emis- 
sion wavelength 11 nm lower than WT, other truncations show 
minor differences (Table I). 

Analytical Ultracentrifugation Analysis—Analytical ultra- 
centrifugation was performed to investigate the association 
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Fic. 5. Continuous sedimentation coefficient distribution of 
the full-length WT and C-terminally truncated SARS-CoV main 
protease. The residual bitmaps of various main proteases are shown in 
the insets. All enzyme preparations used a concentration of 1 mg/ml in 
10 mM PBS buffer (pH 7.6) at 20 °C. A, WT; B, A(293-306); C, A(201-— 
306). The left dotted line indicates the monomer, and the right one is the 
dimer form. 


states of WT and truncated SARS-CoV main proteases. This 
method was successfully used to demonstrate the dimerization 
of SARS-CoV main protease under various conditions (17). The 
data were analyzed by continuous size distribution, which im- 
plemented a highly reliable model, as indicated by the homo- 
geneous bitmap picture (Figs. 4 and 5, insets). All of these data 
were derived from an excellent matching curve of the original 
raw sedimentation data and the randomly distributed residual 
values (data not shown). WT protease shows a monomer-dimer 
equilibrium in solution. The sedimentation coefficients of 2.45 
and 4.2 S were monomer and dimer, respectively, correspond- 
ing to species with molar mass measurements of 34 and 68 kDa 
(17). WT and N- and C-terminally truncated proteases also 
displayed a mixture of monomer and dimer (Figs. 4 and 5). 
Deletion of three residues from the N terminus showed a sim- 
ilar monomer-dimer distribution with WT. The monomer be- 
came the major species when the fourth amino acid residue was 
deleted from the N terminus (Fig. 4). Further deletion of more 
residues (A(1-5), A(1-6), and A(1—7)) from the N terminus 
showed a similar pattern with A(1—4). In addition, we have also 
examined the involvement of domain III in the subunit associ- 
ation. Truncation of the last helix, the A(293-306) mutant, 
caused the SARS-CoV main protease to become a monomer 
(Fig. 5). These results clearly indicated that residues 4 and 
293-306 are critically involved in stabilizing the dimer 
structure. 

The influence of individual residues or domains in the sub- 
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Fic. 6. Global analysis of the sedimentation velocity data of full-length WT SARS-CoV main protease at three protein concentra- 
tions. Sedimentation was performed at 20 °C with an An50 rotor and at rotor speeds of 42,000 revolutions/min. A—C, the concentrations of the 
protein were 0.1 mg/ml, 0.5 mg/ml, and 1 mg/ml, respectively. The sedimentation profiles are from the absorbance optical system at a wavelength 
of 280 nm. The symbols are the raw sedimentation data, and the lines are the theoretical fitted data to the Lamm equation implemented in the 
software SEDPHAT. D-F, the randomly distributed residuals of the fitting model from the upper panel provide a credible analysis result for a 


dissociation constant (K,) of the monomer-dimer equilibrium. 


unit interaction was further quantified by comparing the mon- 
omer-dimer dissociation constants. As mentioned above, the 
global analysis was employed to determine the K, value of WT 
and various truncated mutant proteases (Fig. 6). The K, value 
for monomer-dimer equilibrium of WT was measured to be 0.28 
pM (Table II). Sequential deletion of residues from the N ter- 
minus increased the K, value. The C-terminally truncated 
proteases have much higher K, values than the others. 

Kinetic Properties of the Protease—The deletion mutant of 
the related transmissible gastroenteritis virus main protease 
that lacks residues 1—5 is almost enzymatically inactive (13). 
The crystal structure of the SARS-CoV main protease reveals 
that the N-terminal residues 1-7 from subunit A are directly 
inserted into the active site of subunit B (15). We determined 
the enzyme activity of the full-length, sequential N- and C- 
terminally truncated proteases. The enzyme activity was meas- 
ured by peptide cleavage assay (17). The internally quenched 
fluorescent substrate is cleaved specifically at the Q-S peptide 
bond (38). The apparent K,,, value of WT measured was 17 + 1 
uM, and the apparent k,,, value was 198 + 22 s~*. The mutant 
main protease A(1-3) still possesses 76% of enzyme activity as 
compared with WT (Table IT). However, the enzyme activity of 
other truncated mutants was decreased to only 0.2-1.3% of WT 
activity. 


DISCUSSION 


The biophysical analyses of the SARS-CoV main protease 
and its truncated mutants performed here allowed a detailed 
structural characterization of the mutants compared with the 
full-length protease. In our present data, except the domain 
IlI-truncated protease, A(201—306), which has lower mean res- 
idue ellipticity and fluorescence intensity, the CD and fluores- 
cence emission spectra of other truncated proteases were sim- 
ilar to the full-length protease. With the secondary and tertiary 
structures as anticipated, we then tried to study the structure 
and function relationships of the N- and C-terminal regions of 
the SARS-CoV main protease, especially on the correlation 


TABLE II 
Quaternary structure and enzymatic activity of the full-length and 
truncated SARS-CoV main proteases 


Relative 
Protease Ki Fold RMSD?° enzyme 
activity® 
WT 0.28 + 0.01 1  0.003-—0.005 100 
A(1-3) 3.4+0.1 12 0.007—0.010 76 
A(1-4) 57.5 + 6.3 205 0.007—0.017 1.3 
A(1-5) 35.4 + 2.2 126 0.007-0.013 0.3 
A(1-6) 151 + 0.4 539 0.003-—0.007 0.3 
A(1-7) 857 + 19 1,275 0.006—0.010 0.4 
A(293-306) 163 + 3.2 582 0.005-0.009 11 
A(201-306) 7870 + 302 28,107 0.005-0.008 0.2 


* Dissociation constant of the dimeric SARS-CoV main proteases. 

>’ Root mean square deviation of the fitted sedimentation velocity 
data. 

© Enzyme activity assay performed with the internally quenched sub- 
strate ortho-aminobenzoic acid-TSAVLQSGFRK-2,4-dinitrophenylam- 
ide. The enzyme activities of the various truncated mutants were too 
small to allow a precise determination of the kinetic parameters. Only 
the relative enzyme activity was reported. 


between dimerization and enzyme activity. We used the CASTp 
program (389) to analyze the pockets and cavities of the enzyme. 
Only three of the cavities with mouths were big enough to be 
significant (Fig. 1C). One of these pockets was located at the 
interface between domains II and III with a surface area of 2.82 
nm” and has no contact with protomer B (Fig. 1C, yellow 
pocket). The two largest pockets were located at the subunit 
interfacial region. The N-terminal finger and C-terminal tip 
were sited at the pocket with a solvent-accessible surface area 
of 18.41 nm? and a volume of 2.173 nm? (Fig. 1C, purple 
pocket). The amino acid residue contacts between subunits A 
and B were analyzed with the Contacts of Structural Units 
software (40). Phe-3 in subunit A contacts with subunit B with 
only one destabilizing hydrophobic-hydrophilic contact. How- 
ever, Arg-4 (A) extends deeply into subunit B (Figs. 1 and 7) 
involving six destabilizing contacts. It is then clear that re- 
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SARS-CoV Main Protease 


Protomer A Protomer B 


Fic. 7. Interfacial regions of the dimeric SARS-CoV main pro- 
tease. The interfacial contact surface regions within 1 nm for both 
protomers A and B are shown in mesh model form. The same surface 
areas were turned 90° around the x-axis to give C and D, respectively, 
from A and B. The C-terminal helical region (red) and the N-terminal 
finger (blue) are shown in stick mode. This figure was generated with 
Spock software. 


moval of Arg-4 has great impact on the structure of the enzyme 
molecule. 

All of the coronavirus conserved an extra large C-terminal 
a-helical domain that is not found in other RNA virus 3C-like 
proteases (18, 15). The precise biological role of the a-helical 
domain is very interesting but still not completely understood. 
Our experimental data indicate that the C-terminal domain 
plays an important role in dimerization and enzyme activity. 
These results are consistent with the previous report that loss 
of the extra domain III of the SARS-CoV main protease induced 
monomer formation and loss of enzyme activity. A further step 
forward, we have narrowed down the region to a single a-helix. 

A similar thermal stability curve was observed for most 
truncations, except for the last a-helically truncated protease, 
A(293-306), which showed significantly lower T,,, than WT 
(Table I). Deletion of the whole domain III, on the other hand, 
restores the stability of the molecule. These results imply the 
functional role of domain III of the protease. The deleted last 
helical segment is located at a large pocket between domains IT 
and III (Fig. 1C, yellow pocket). This helix is essential for 
protein stability of the whole molecule. Without this helix, the 
remaining domain III becomes a burden for domains I and II 
that renders the protease unstable. 

With a complete domain III, the full-length SARS main pro- 
tease was in monomer-dimer equilibrium with dimer as the 
major form, even in very low protein concentration (0.1 mg/ml) 
(17). This result is different from some of the recent reports (18, 
41), which showed a major monomer in a protein concentration 
of <0.2 mg/ml. This discrepancy is due to technical differences 
used in characterizing the quaternary structure of the protein. 
It is extremely important to obtain an unequivocal answer to 
this question, because the dissociated monomer might be en- 
zymatically inactive. To study the protein self-association, we 
used the rebirth state-of-the-art analytical ultracentrifugation 
technique. The K, value determined by analytical ultracentrif- 
ugation (0.28 um), however, is ~360 and 810 times smaller 
than that previously estimated from the analytical gel filtra- 
tion experiment and isothermal titration calorimeter, respec- 
tively (19, 42). Our K, value for the monomer-dimer equilib- 
rium of SARS-CoV main protease was obtained from global 
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analysis of three different protein concentrations. Under the 
stringent conditions in analytical ultracentrifugation, we be- 
lieve that the K, obtained is more reliable (33). 

Our data showed that the N-finger of SARS-CoV main pro- 
tease was indispensable for proteolytic activity, which was 
consistent with previous findings (19). The contribution of the 
N-finger in dimerization was clearly demonstrated by analyti- 
cal ultracentrifugation analysis (Fig. 4). As shown in Fig. 7, the 
N-finger from protomer A is completely buried into protomer B 
(Fig. 7, A and B). We have further narrowed down to a single 
residue, Arg-4, which plays a pivotal role in building the mo- 
lecular interaction (Fig. 4). The correlation between enzymatic 
activity and dimerization is also demonstrated. The fourth 
residue plays a critical role in dimerization of SARS-CoV main 
protease, which is essential for the enzymatic activity. 

In conclusion, both the N- and C-terminal regions play piv- 
otal roles in controlling the dimerization and activity of the 
enzyme. Our study provides fundamental information for the 
novel design of inhibitors of the SARS-CoV main protease 
activity by disrupting its dimerization interface. 
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